WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
G06F 17/30, H04L 29/06 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 98/33130 

30 July 1998 (30.07.98) 



(21) International Application Number: PCT/US97/23431 

(22) International Filing Date: 19 December 1997 (19.12.97) 



(30) Priority Data: 

08/788,971 



24 January 1997 (24.01.97) 



US 



(71) Applicant: MOTOROLA INC. [US/US]; 1303 East Algonquin 

Road, Schaumburg, IL 60196 (US). 

(72) Inventors: THERIAULT, Roger; 4763 Concordia Lane, Boyn- 

ton Beach, FL 33436-7701 (US). LOCKHART, Thomas, 
Wayne; 9400 Ryan Crescent, Richmond, British Columbia 
V7A 2H2 (CA). BATTIN, Robert, D.; 107 Bent Creek 
Court, Palatine, IL 60067 (US). 

(74) Agent: DUNLOP, Hugh, C; Motorola Inc., Intellectual Prop- 
erty Dept., 1303 East Algonquhi Road, Schaumburg, IL 
60196 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 

BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GW, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, 
TM, TR, TT, UA, UG, UZ, VN, YU, ZW, ARIPO patent 
(GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CP, CG, CI, CM, 
GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Title: PROXY HOST COMPUTER AND METHOD FOR ACCESSING AND RETRIEVING INFORMATION BETWEEN A 
BROWSER AND A PROXY 




(57) Abstract 

A query (160) is sent from a browser (100, 101) to a proxy (300) directed to an information source (140) in a networked data 
communications system. In one aspect, the query is modified by the proxy to provide a modified query and the modified query is forwarded 
to the information source (120). A response (170) is received at the proxy from the information source and forwarded to the browser. In 
another aspect the response is modified by the proxy to provide a modified response (370) which is forwarded to the browser. The proxy 
(300) has a file (341) of services established for at least one specified browser (100) and filter (302, 304) for filtering queries and responses 
from browsers. The filters are responsive to the file of services for filtering queries from the at least one specified browser. 



FOR THE PURPOSES OF INFORMATION ONLY 



Ccxies used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCX, 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TO 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


T6 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


Cdte d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Fedentfion 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







wo 98/33130 



PCT/US97/23431 



PROXY HOST COMPUTER AND METHOD FOR ACCESSING AND 
RETRIEVING INFORMATION BETWEEN A BROWSER AND A 

PROXY 

5 Field of the Invention 

The present invention relates to communications and more 
particularly with improved methods for accessing and retrieving 
information in a networked data communication system via an 
1 0 enhanced proxy. Separately and in addition it relates to a proxy for 
accessing and retrieving information between a networked data 
communications system and a browser. 

Background of the Invention 

15 

In a networked data communications system, users have access 
to terminals which are capable of requesting and receiving information 
from local or remote information sources. In such a system the 
terminal may be a personal computer (PC), a cellular phone, a mobile 

20 data terminal, a radio modem, a portable computer, a personal digital 
assistant (PDA), a pager, or any other similar device. The capability of 
the terminal to request and receive information may be provided by an 
application program or other such mechanism. A terminal provided 
with these capabilities is referred to as a browser. 

25 In such a system the information source may be a server (e.g., a 

host computer) coupled to a mass information storage device (e.g., a 
hard drive disk pack). The exchange of information (i.e., the request 
and receipt of information) between the terminal and information 
source is facilitated by a connection referred to as a communication 

30 channel. The communication channel may be physically realized via a 
wire (e.g., a telephone line), a radio signal (e.g., a radio frequency (RF) 
channel), a fiber optic cable, a microwave link, a satellite link or any 
other such medium or combination thereof connected to a network 
infrastructure. The infrastructure may be a telephone switch, a base 

3 5 station, a bridge, a router, or any other such specialized component, and 
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facilitates the connection between the browser and the network. 
Collectively, the interconnected group of terminals, physical 
connections, infrastructure and information sources is referred to as a 
network. 

5 The network itself may take a variety of forms. It may be located 

within a small, local geographic area, such as an office building, and 
consist of only a limited number of terminals and information sources. 
This type of network is commonly referred to as a Local Area Network 
(LAN). On a broader scale, it may be larger and support more users 

1 0 over a wider geographic area, such as across a city or state. This type of 
network is commonly referred to as a Wide Area Network (WAN). On 
an even broader scale the LAN and WAN networks may be 
interconnected across a country or globally. An example of a globally 
connected public data communications network is the Internet. 

15 To a user the Internet appears to be a single unified network, 

although in reality it consists of hundreds different types of computer 
platforms utilizing many diverse data communications technologies. 
The technologies are connected together in such a manner so they 
appear transparent to the user. This transparency is made possible 

20 through the use of a standard communications protocol suite known as 
Transmission Control Protocol /Internet Protocol (TCP/IP). 

Recently, Hypertext Markup Language (HTML) and Hypertext 
Transfer Protocol (HTTP) in particular have developed to make the 
World Wide Web very accessible. The exchange of information on the 

25 Web is further facilitated through hypertext documents. Hypertext 
documents are unique in that they use tags to define links (i.e., 
highlighted or underscored words or phrases) which, when selected, 
fetch the related information from within the same document or from 
a new document altogether. The links are defined using HTML which 

3 0 provides a document formatting method that adapts in a consistent 

manner to any computer on which it is displayed. HTML tags are used 
to define the various components of an ASCII text file which make up 
a hypertext document, including such things as formatting and linking 
to other documents. Tags which link documents on one Web 

3 5 information source to those on another do so by associating a Uniform 
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Resource Locator (URL) with the referenced information. The ability to 
link Web files of similar and /or differing formats to each other, and to 
link documents on other Internet sites, is a very powerful feature of the 
Web. 

5 The development of sophisticated browsers specifically for the 

Web, (i.e., browsers which utilize HTTP to request and receive HTML 
documents) have also helped to further increase its use and popularity. 
Standard web browsers, such as Mosaic™ or Netscape™, adhere to 
standard HTML and HTTP protocols and conventions. 

1 0 The appeal of the Internet is the large-scale interconnection of 

public and private networks. A concern exists, however, about "un- 
authorized" access from public networks to the attached private 
networks. This concern has resulted in the development of proxies. A 
proxy is a host computer or mechanism (usually an application 

1 5 program) on a network node which performs specialized functions on 
a network. One such function is to provide network security. Security 
is provided between a private and public network by requiring 
communications (i.e., information exchanges) to pass through the 
proxy. Another function of a proxy is to store or cache recently accessed 

20 information (i.e., copies of documents and images). If a browser desires 
information which is located outside the local network that is to say on 
an information source attached to an external network, 
communications pass from the browser through the proxy before going 
on to the external network. 

25 Thus a proxy may operate to deny access to a private network 

from a public network by not replying to HTTP commands received 
from the public network. 

Also a proxy may operate to deny access to specific Web sites, for 
example sites potentially offering undesirable information. This is 

3 0 achieved by maintaining a list of URLs at the proxy to which access is to 
be denied. HTTP commands which contain these URLs are not 
executed by the proxy and are responded to with a predefined message. 
It is also achieved by identifying a particular string in a HTTP 
command and sending the predefined message if such a string is 

3 5 identified. 
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While proxies address security problems, there are other 
problems which need to be addressed, such as those exacerbated by a 
low band- width connection to the browser, or access to undesirable 
information. 

5 There is a need for an improved method of accessing and 

retrieving information in a networked data communications system. 
There is also a need for an improved proxy for accessing and retrieving 
information between a networked data communications system and a 
browser. 

10 

Brief Description of the Drawings 

FIG. 1 is a system diagram of a typical prior art networked data 
communications system which includes a proxy; 
1 5 FIG. 2 is a system diagram of a networked data communications 

system including an enhanced proxy according to the preferred 
embodiment of the present invention; 

FIG. 3 is a flow chart describing the processes required to access 
information via an the proxy of FIG. 2; 
20 FIG. 4 is a flow chart describing the processes required to retrieve 

information via an enhanced proxy; 

FIG. 5 is a flow chart describing the processes required to 
determine the filtering services to be used by the enhanced proxy; 

FIG. 6 is a flow chart describing the processes required to establish 
25 the filtering services from the content and /or format of a query; 

FIG. 7 is a flow chart describing the processes required to establish 
the filtering services by querying a page of configuration information; 

FIG. 8 is a flow chart describing the processes required to establish 
the filtering services by fetching and responding to a configuration 
30 form; 

FIG. 9 is a flow chart describing the processes required to modify 
a query according to the established filtering services to restrict access to 
an information source. 

35 Detailed Description of the Drawings 
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FIG. 1 illustrates components of a typical networked data 
communications system. A browser 100 is connected to an information 
source 140 via a browser link 110, a network 120 and an information 
5 link 130. Together the browser link, network and information link 
form the communication channel. A local network 230, a local 
information source 240, a proxy 200, a proxy attachment point 210, and 
a cache 220 are shown. 

The browser 100 is connected to the local network 230 via the 

1 0 browser link 110 and exchanges information with attached local 

information source 240 via this link. The local information source 240 
retrieves the information from the attached local information storage 
device 250 and sends it back to the browser via the browser link 110. 
Information exchanges between the browser and the local information 

1 5 source do not pass through the proxy. 

Information exchange is effected by sending requests or queries 
and responses between the browser and the information source via the 
communication channel. For example in FIG. 1, a browser sends a 
query 160 over the communication channel for information which is 

20 located at the information source 140. The information source 140 
retrieves the information from the attached information storage 150 
and sends the response 170 (i.e., the retrieved information) back to the 
browser via the communication channel. 

The communication channel from the browser 100 to the 

25 information source 140 is established by passing through local network 
230 to the proxy 200. The browser connects to the proxy via an 
attachment point 210 (e.g., a port on the host computer). Queries 160 
from and responses 170 to the browser are via the attachment point. A 
cache 220, which may be used to store copies of recently received 

3 0 information responses from the information source, is attached to the 
proxy. 

Access to the Internet is often provided by an service provider 
which may charge the user a fee based on the amount (e.g., kilobytes) of 
data transferred. When a user clicks on an HTML link the physical size 
35 of the information requested is not provided in advance to the user. 
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As a result, users may unwittingly request and receive very large 
quantities of data (e.g. image files) in which they may have only a 
passing interest, or which may contain material they would like to 
filter out (e.g. pornography), and for which they may be charged. It 
5 would be desirable if, at the user's request, a proxy could reduce in some 
fashion the amount of the information before it is delivered to the 
user's browser. It would also be desirable if the proxy could provide to 
the browser an advance indication of the characteristics (e.g., size, 
content, etc.) of the information before it is transferred. 

1 0 Very little information about the capabilities of the browser is 

available to the proxy. For instance, a user may request an HTML 
document that contains tables or forms, but the browser may not be 
capable of displaying such data. In this a case the user would have to 
pay for data which would be unrecognizable by the browser and would 

15 be useless to the user. It would be desirable if the proxy could 
determine in advance the capabilities of the browser. 

The user has no means of requesting and selecting different 
services such as filtering from the proxy. For instance, a browser may 
contain the ability to compress and un-compress information and the 

20 user may want to use that capability when transferring HTML data. In 
another instance, the proxy administrator may want to provide a 
service to the users of the proxy which prohibits the access of minors to 
specific servers. At present there is no means for the user to request 
such special or enhanced services from the proxy. It would be desirable 

25 if the user could be provided with a means by which to select the proxy 
services they want to use. 

These problems are not of equal concern to users of a traditional 
wireline network and users of a wireless radio network. In particular, 
the above problems are exacerbated in a wireless network, where the 

3 0 bandwidth is limited, data transfer rates over the browser link are 
relatively slow, and where data transfer charges are comparatively 
high. Therefore, there remains a need for an improved means of 
accessing and retrieving data in networked data communications 
systems and more particularly from the Internet or Web that solves 

35 these and related problems. 
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The following description provides various methods of accessing 
and retrieving information via an enhanced proxy in a networked data 
commurucations system having an infrastructure, an information 
source, and a browser coupled by communication channels. A first 
5 aspect of the invention provides a method for accessing and retrieving 
information via an enhanced proxy over communication charmels 
established between a browser and an information source, including a 
means of modifying the information by the proxy according to user 
selected filtering services. 

10 In accordance with a second aspect of the invention, means are 

provided for the browser to establish and select the filtering services to 
be used by the proxy when processing queries and responses on behalf 
of that browser. This may be accomplished in several ways which may 
be used independently or in combination with each other. 

1 5 According to a first embodiment, a specific set of filtering 

services is associated with a particular proxy attachment point (e.g. a 
physical port number). A browser selects the set of filtering services 
associated with an attachment point by using that attachment point to 
communicate with the proxy. 

20 According to a second embodiment, the browser uses the 

content and/or format of the query to select the set of filtering services 
to be used by the proxy. For example, by using an enhanced format 
query the browser implicitly requests enhanced format responses. 

According to a third embodiment, the browser selects filtering 

25 services by querying specific information. For example, the proxy has 
associated with it an information source which contains Web pages 
that describe the filtering services provided by the proxy. A browser 
might request these pages in order to enable or disable particular 
filtering services. 

3 0 According to a fourth embodiment, the proxy has associated with 

it an information source which contains forms for selecting filtering 
services. The user selects the desired filtering services by fetching a 
form and completing and returning it to the proxy information source 
via the browser. 
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According to a further aspect of the invention, methods are 
provided for the proxy to filter the requested and /or retrieved 
information. Two classes of filtering services are employed by the 
proxy: filtering of the query received from the browser prior to 
5 forwarding it to the information source; and filtering of the response 
received from the information source before forwarding it to the 
browser. The techniques employed for both query and response 
filtering are devised to improve the efficiency of the use of the 
formatting and protocol conventions (e.g., HTTP and HTML). Efficient 

1 0 use of these conventions is important because the browser link may be 
a limited bandwidth resource, such as an RF chaimel. 

According to a still further aspect of the invention, a mechanism 
is provided whereby the proxy can send an advance indication of the 
characteristics of the information requested by the browser. 

1 5 Annotation of files size, content characteristics (e.g., text, image, video, 
etc.), content classification (e.g., suitable for children, adults only, etc.), 
may be passed on to the browser by the proxy. The proxy does this, for 
example, by fetching the information requested by the browser, 
analyzing it, creating a response containing the results of the analysis, 

20 and forwarding it to the browser. 

Additionally the proxy and the browser are able to process 
enhancements to standard formatting and protocol conventions (e.g., 
HTML and HTTP). For example, the HTML protocol may be extended 
by re-encoding the HTML tags to become more efficient, or the HTTP 

25 protocol may be extended to provide support for an "open" proxy 

session link for data transmission. Support of such enhancements is 
important as they would improve the efficiency of Web browsing in 
both wireless and wireline networks. 

Turning now to FIG. 2, there is generally depicted a networked 

3 0 communication system in accordance with a first embodiment of the 
invention. The system shown includes two browsers 100 and 101 (and 
will typically have many browsers) coupled to an enhanced proxy 300 
via browser links 110 and 111 respectively, a local network 230, and one 
or more proxy attachment points 310, 312. The enhanced proxy 300 is 

3 5 coupled to a proxy information storage 330, a proxy configuration 
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database 340, a cache 220, an information source 140 (via an 
information link 130) and a network 120, The information source 140 
is similarly coupled to the network 120 via the information link 130, 
and as well as to an information storage device 150. 
5 The enhanced proxy 300 is a processor and incorporates, in 

software, a receive module 301, a send module 303, a query filter 302, a 
response filter 304, a parser 344 and a database updater 345 as well as 
other elements which need not be described in detail. All the various 
couplings between the elements in enhanced proxy 300 are not shown 

10 in full, for the sake of simplicity of explanation. Query filter 302 and 
response filter 304 are coupled between receive module 301 and send 
module 303, but are illustrated aside from these modules for simplicity 
of explanation. Also, a direct link is selectively provided (not shown) 
from receive module to transmit module for selectively passing 

1 5 unfiltered queries and responses beteen these modules. 

Query filter 302 filters queries received from the browsers 100 
and 101. Response filter 304 filters responses from the information 
storage 140 to the browsers 100 and 101. The filters are implemented in 
software and are coupled to and responsive to files in the proxy 

20 configuration database 340, the database 340 having a file 341, 342, 
which is specific to each of the browsers 100 and 101. 

Parser 344 parses queries received at receive module 301 and in 
response to receipt of certain queries, parser 344 causes a look-up of 
information in proxy information storage 330 (or, through connections 

25 which are not illustrated, it causes a look-up of information in proxy 
configuration database 340 or in cache 220). In response to certain 
queries or forms received at receive module 301, parser 344 causes 
updater 345 to update database 340, as is described in greater detail 
below. Parser 344 also parses responses from information link 130. 

3 0 The proxy configuration database maintains the files 341, 342 etc., 

each file including a set of services associated with a corresponding 
browser 100, 101 for identifying the method of filtering to apply to a 
query as a function of the services associated with the particular 
browser. The proxy information source can contain a variety of 

3 5 information. The proxy information storage 330 contains messages or 
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pages for sending to a browser. It also contains forms for sending to the 
browser or configuration update pages which the browser can request. 
These details are described in greater detail below. A database updater 
345 is provided, coupled to the proxy information source and the proxy 
5 configuration database, which is primarily a "write" operator for 
updating the proxy configuration database 340 when certain 
interactions occur between the browser 100 and the proxy processor 320, 
described in more detail below. 

An optional non-proxy service peripheral device 355 is provided, 

1 0 such as a fax machine or paging message generator. 

In the preferred form: the browser 100 is a personal computer 
(PC) fitted with a radio frequency (RF) modem and communications 
software running a standard browser program such as Netscape (trade 
mark); the browser link 110 is a low speed data link such as a 19.2 

1 5 Kbit/second radio data channel; the local network 230 is a wireless RF 
network such as an ARDIS (registered trade mark), DataTAC (registered 
trade mark) or CDPD network; the proxy attachment points 310, 312 are 
ports on a host computer 290, otherwise referred to as a "server"; the 
enhanced proxy 300 is an application program running on the host 

20 computer 290; the proxy information storage 330 is a mass storage 

device such as a hard disk drive; the proxy configuration database 340 is 
a database application program such as Sybase (registered trade mark) 
running on the host computer 290; the cache 220 is a short-term high- 
speed temporary storage area accessible by the enhanced proxy 300; the 

25 information link 130 is a high speed data link such as a T-1 link; the 
network 120 is a packet switched public data network such as the 
Internet; the information source 140 is a server such as a web-server; 
and the information storage 150 device is a mass storage device such as 
a hard disk drive. Together the browser link 110, local network 230, 

3 0 proxy attachment point 310, enhanced proxy 300, network 120 and 
information link 130 form a communication channel between the 
browser 100 and information source 140 over which information may 
be exchanged. 

Information exchange between the browsers 100 and 101 and the 
3 5 information source 140 is facilitated by sending queries 160 and 
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responses 170 through the enhanced proxy 300 via the communication 
channel. The purpose of the enhanced proxy is to modify a query 160 
and /or response 170 according to a set of filtering services that the 
browser has established. Therefore, when a query 160 is received on the 
5 proxy attachment point 310, the query is channeled through the 

enhanced proxy 300 to be modified according to the filtering services 
established with the proxy for that browser. The modified query 360 is 
then forwarded by the enhanced proxy on to the information source 
140 via the communication channel. This process is described in more 

1 0 detail below with reference to FIG. 3. 

On receiving the modified query 360, the information source 140 
fetches the requested information from the attached information 
storage 150 device and sends the response 170 back to the enhanced 
proxy 300 via the communication channel. The response filter 304 of 

1 5 the enhanced proxy 300 modifies the response according to the set of 

filtering services established for the browser, possibly storing all or part 
of the modified response 370 on the proxy information storage 330 
device, and sends the modified response 370 back to the browser. This 
process is described in more detail below with reference to FIG. 4. 

20 While FIG. 2 depicts only one local network, one local 

information source, one local information storage, one enhanced 
proxy, one proxy configuration database, one proxy information source, 
one proxy information storage, one cache, one network, one 
information link, one information source and one information storage 

25 device, a practical system may include a plurality of each. Also a 
practical system may have fewer or more than two browsers, two 
browser links and two proxy attachment points. Although FIG. 2 
portrays a preferred system, the arrangement is expected to have equal 
applicability to both wireline and wireless data communications 

3 0 systems. Additionally the invention is expected to have applicability to 
data communications regardless of the specific form the browser or 
network may take. 

Referring to FIG, 3, the process of accessing information via an 
enhanced proxy is illustrated, beginning at step 400. At this step the 

35 enhanced proxy 300 receives a query 160 from a browser 100 at the proxy 
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attachment point 310. The query 160 contains a request from the 
browser to establish a communication link with the enhanced proxy 
300 via the proxy attachment point 310 and browser link 110. At step 
405 the enhanced proxy 300 responds by establishing the requested link 
5 with the browser. It should be noted that steps 400 and 405 are 

characteristic of the current industry standard methods of establishing 
communications between a browser and a proxy and as such, further 
details of these steps are not necessary for an understanding of the 
invention. The process of accessing information continues at step 410 

1 0 when the browser sends a query 160 for information via the browser 
link 110 to the enhanced proxy 300, and the proxy receives this request 
at the proxy attachment point 310. Each proxy attachment point 310, 
312 has a set of filtering services defined for it, which are described in 
greater detail below. When the browser 100 communicates with the 

1 5 enhanced proxy 300 via a specific proxy attachment point, e.g. 

attachment point 310, the filtering services which are defined for that 
attachment point will be applied to both the queries and responses for 
that particular browser. Processing continues at step 415 when the 
proxy determines the filtering services to be applied to the query. From 

20 step 415 the proxy proceeds to step 420. At step 420 the proxy has 

ascertained the set of filtering services which are to be applied to the 
query. Application of the filtering services to the query is carried out by 
the query filter 302 resulting in the modified query 360. 

The modifications to the query (described below) are dependent 

25 on the filtering services established for the browser as obtained from 

step 415, and that the modified query may or may not result in retrieval 
of the information as originally requested by the query in step 410. It 
should be noted that at the conclusion of step 420 the modified query 
360 will contain the URL of the appropriate information source (i.e., 

30 the URL of the originally requested information source, or some other 
URL as determined by the filtering services). 

By way of example, the case is considered where the URL of the 
modified query 360 indicates the information is located at information 
source 140. At step 425 the proxy requests a link to be established with 

3 5 the information source 140. Next the proxy proceeds to step 430 and 
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establishes the information link 130 between the enhanced proxy 300 
and the information source 140. Step 435 ensures that the proxy 
associates the browser link 110 established in step 405 with the 
information link 130. That is to say the proxy records the proxy 
5 attachment point (i.e., port number) and the browser ID which is 
cormected to that attachment point for this specific communication 
session. This mapping (i.e., the browser ID to proxy attachment point) 
is recorded in configuration database 340 so that the proxy may process 
multiple, concurrent browser sessions. Finally, at step 440 the 

1 0 enhanced proxy 300 forwards the modified query 360 via the 

information link 130 to the information source 140. Step 440 ends the 
processes associated with accessing an information source via a query. 

Before proceeding to descriptions of methods of modifying the 
queries and responses, there first follows a description of the process of 

1 5 retrieving the requested information with reference to FIG. 4. 

Beginning at step 500, the enhanced proxy 300 receives a response 170 to 
the modified query 360 from the information source 140. The response 
is sent via the information link 130 previously established at step 430 in 
FIG. 3. Once the enhanced proxy has received the response the link 

20 between the proxy and the information source no longer needs to be 
maintained and can be terminated. Step 505 terminates the 
information lirik 130 between the enhanced proxy 300 and the 
information source 140. Processing continues at step 510 when the 
proxy determines the filtering services to be applied to the response 170. 

25 The determining of the filtering services is described below. From step 
510 the proxy proceeds to step 515. At step 515 the proxy has ascertained 
the set of filtering services which are to be applied to the response. 
Application of the filtering services to the response produces the 
resultant modified response 370. Like the modifications to the query, 

30 the modifications to the response are dependent on the filtering 

services established for the browser as determined in the previous step. 
The modifications to the response are described below and may be 
relatively minor or fairly extensive. These modifications are carried 
out by the response filter 304 of the enhanced proxy 300 and stored on 
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the proxy information storage 330 by the proxy for future reference by 
the browser. This occurs at step 520. 

Once the modifications have been made to the response, step 525 
forwards the modified response 370 to the browser 100 via the browser 
5 link 110 as described above with reference to step 405 of FIG. 3. Once 
the browser receives the modified response the link between the 
enhanced proxy 300 and the browser 100 no longer needs to be 
maintained. Step 530 ends the process by terminating the browser link 
110, thus concluding the processes associated with retrieving 

1 0 information via an enhanced proxy. 

The process of determining the set of filtering services to apply to 
a query and/or response is more fully described by referring to FIG. 5. 
The filtering services and the associated filtering techniques are 
ascertained from one or more of four sources of information available 

15 to the enhanced proxy: from the browser identification (ID); from the 
proxy attachment point; from the format of the query and /or response; 
and from the proxy configuration database. 

The proxy configuration database 340 is implemented in such a 
manner so as to facilitate a table look-up or other similar mechanism 

20 which maps the proxy attachment point and /or the browser ID to the 
filtering services defined in the database for that browser. 

Referring now to FIG. 5, the process of determining the filtering 
services begins at step 600. At step 600 the proxy determines the proxy 
attachment point (i.e., port number) to which the browser is connected 

25 for this specific communication session. Next the proxy proceeds to 

step 605 where the enhanced proxy 300 uses the proxy attachment point 
310 as a parameter to look up the filtering services in the proxy 
configuration database 340 defined for that attachment point. Having 
retrieved the filtering services for the proxy attachment point, the 

30 proxy proceeds to step 610. At this step the proxy determines if any of 

the fetched filtering services may be altered by the user (i.e., the filtering 
services are user definable). If any of the filtering services are user 
definable, the proxy proceeds to step 615. If none of the filtering 
services is user definable, the proxy proceeds to step 630. 
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Assuming there are user definable filtering services, the proxy 
proceeds with step 615 and ascertains the identity of the browser (i.e., 
browser ID) from the query 160 and/or response 170. Methods by which 
the identity of a browser may be uniquely determined include: using 
5 the browser's Internet Protocol (IP) address; using an identification 
number assigned by the proxy service provider at the time of 
registration for the service; or any other such similar method. Next, at 
step 620, the browser ID is used as a parameter to look up the filtering 
services in the proxy configuration database 340 defined for that 

1 0 browser. Finally, at step 625 the filtering services defined for the proxy 
attachment point as fetched in step 605 are merged with the filtering 
services defined for the browser ID as fetched in step 620 to produce a 
set of filtering services for the browser ID. 

Proceeding to step 630, the proxy determines the type of query 

1 5 and /or response it has received at step 410. The query type is 

determined by the capabilities of the browser: those which adhere to 
standard HTML and HTTP protocols and conventions (e.g., as 
generated by Netscape (trade mark)) are referred to herein as standard 
browsers and generate standard format queries or responses; those 

20 which are characterized by enhancements such as those defined in 
herein are referred to as enhanced browsers and generate enhanced 
format queries or responses. 

At step 635 the proxy proceeds by checking the format type the 
query or response. If the query or response indicates the enhanced 

25 format, then the proxy proceeds to step 640, otherwise if the query or 
response is standard format, then the proxy proceeds on to step 650. 
From step 635 the processing proceeds to step 640 where the proxy 
fetches the filtering services for enhanced format queries or responses 
from the proxy configuration database 340. Next at step 645, the 

30 filtering services as fetched in steps 605, 620, and 640 are merged to 

produce a set of filtering services for the browser ID. Finally, at step 650, 
the complete set of filtering services defined for the browser ID (i.e., as 
determined at steps 605, 625, or 645 respectively) are returned to the 
calling procedure (i.e., either step 415 of FIG. 3 or step 510 of FIG. 4). 
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Thus it has been described how an aspect of the invention 
enables a user to request special or enhanced services from the proxy. 
Several specific means are now described by which to select the proxy 
services the user wishes to use with reference to FIGs. 6 through 8. 
5 Various specific methods of configuring the enhanced proxy to 

establish the filtering services to be supported by the enhanced proxy 
are described in these figures. Additionally various methods by which 
the browser may select the supported filtering services are also 
described, 

10 A first method of establishing proxy filtering services involves 

associating a set of filtering services with a particular proxy attachment 
point (i.e., a physical port number). In this method, the filtering 
services are established by the proxy service provider (PSP) which 
determines the type of filtering services that will be supported and the 

1 5 proxy attachment points 310 that will be associated with those services. 
For each proxy attachment point 310, an entry is created in the proxy 
configuration database 340 which defines the supported filtering 
services. The proxy attachment point 310 is used as a parameter to look 
up the entry in the proxy configuration database which defines the 

20 filtering services. Step 605 in FIG. 5 and illustrates this process. 

A specific embodiment of this method takes into consideration 
the capabilities of the browser (i.e., standard vs. enhanced) when 
providing different levels of filtering service for each proxy attachment 
point. For example, if the enhanced proxy has two proxy attachment 

25 points, then the filtering services for one proxy attachment point is, for 
example, defined for standard browsers, and the other for enhanced 
browsers. 

It should be noted that the enhanced proxy does not impose any 
particular type of filtering service. Rather, mechanisms are provided, 
3 0 described below, which enable the definition and association of filtering 
services with the proxy attachment points. Consideration of the 
capabilities of the browser is important as it allows the proxy service 
provider to choose filtering techniques which are optimized for a 
particular browser type. 
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For example, it does not make sense to implement data 
compression for all browser queries unless the requesting browser 
supports compression. To do so would result in transmission of un- 
decipherable information to the browser. The converse is also true, it is 
5 futile for the browser to request filtering services which it cannot 

handle. In such worst-case scenarios, wasted traffic communications 
(i.e., data which must be discarded) would not provide the desired 
improvement in channel utilization. The ability to define filtering 
services for proxy attachment points which are optimized according to 

1 0 the capabilities of the browser provides a simple solution to improving 
channel utilization. 

It should be noted that the information pertaining to the type of 
filtering associated with each proxy attachment point is supplied to 
subscribers by the PSP when they register for the service. The user 

1 5 selects the set of filtering services associated with a proxy attachment 
point (e.g. standard vs. enhanced) by using that attachment point to 
communicate with the enhanced proxy. The proxy attachment point 
determines the filtering services which are applied to the queries 
and /or responses of the browsers utilizing that particular port. Since 

20 the user is given information pertaining to the type of filtering service 
for each proxy attachment point, he is capable of changing the proxy 
attachment point that he is attached to as it suits his needs and 
purposes. 

A second method of establishing and selecting proxy filtering 
25 services utilizes the content and /or format of the browser query to 
select the set of filtering services to be used by the proxy. 

In the case of utilizing the content of the browser query, the 
enhanced proxy employs standard data processing parsing techniques to 
extract the content of the query. The extracted content of the query is 
3 0 then evaluated to determine if it is a request to select or modify the 
filtering services that are defined in the proxy configuration database 
for that browser ID. This type of query is referred to as a configuration 
query, and is directed to the enhanced proxy itself for the specific 
purpose of selecting and /or modifying the filtering services for a 
3 5 browser. 
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In the case of utilizing the format of the browser query, the 
enhanced proxy again employs standard data processing techniques to 
determine the format of the query. If it is determined that it is an 
enhanced format query then this implies that the proxy should 
5 formulates responses to the browser also utilizing enhanced 
formatting. 

The process of establishing and selecting proxy filtering services 
via the content and/or format of the browser query can be more fully 
appreciated by referring to FIG. 6. The processes described by FIG. 6 are 

1 0 inserted after step 410 and before step 415 of FIG. 3. At step 410 of FIG. 3, 
the query 160 has been received by the enhanced proxy 300 at the proxy 
attachment point 310. At step 700 of FIG. 6, the enhanced proxy 300 
utilizes standard data processing parsing techniques to extract the 
contents of the query 160 as received in FIG. 3 step 410. Next, at step 

1 5 705, the extracted query contents are evaluated to determine if this is a 
configuration query. If this is a configuration query, processing 
continues at step 710, otherwise processing continues at step 715. In 
order to retrieve the configuration information maintained by the 
proxy information storage 330, the query is modified at step 710 so that 

20 it is directed to the aforementioned information source. 

At step 715 the extracted contents of the query are again 
evaluated to determine if it is an enhanced format query (i.e., as 
generated by an enhanced browser). If it is an enhanced format query, 
processing continues at step 720, otherwise processing returns to 

25 continue at step 415 of FIG. 4. 

At step 720 the entry in the proxy configuration database 340 for 
the specific browser ID is modified so that all responses for this browser 
are formulated in enhanced format. The establishment and selection 
of filtering services via the content and /or format of a query is 

3 0 concludes at step 720, and processing returns to continue at step 415 of 
FIG. 3. 

A third method of establishing proxy filtering services involves 
querying a page of configuration information on the proxy 
configuration database to establish the filtering services for the 
3 5 particular browser. 
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In this method the proxy service provider pre-defines one or 
more HTML format configuration pages which reside on the proxy 
information storage 330, Each HTML configuration page contains 
information about the type of browser supported (i.e., standard vs. 
5 enhanced), and a description of the filtering services associated with 
and defined for that page. An entry corresponding to each 
configuration page is also created in the proxy configuration database 
340 which describes the filtering services for each such page. A list of 
the browser IDs that have requested specific configuration pages is also 

1 0 associated with each configuration database entry. A simple example of 
a configuration page would be one which was defined by the PSP to be a 
default configuration page. The default configuration page describes 
the basic or default level of filtering services which is to be used by all 
proxy subscribers until they otherwise initiate a change. 

1 5 Utilizing this method the user selects the filtering services to be 

used by sending a query 160 to the enhanced proxy 300 requesting a 
configuration page which is stored on the proxy information storage 
330. The enhanced proxy responds by returning the requested 
configuration page, noting which configuration page was accessed and 

20 the browser ID requesting the information in the configuration 
database 340. 

This method of configuring the proxy is based on the 
assumption that, given an initial default level of filtering service, the 
user will change the filtering services to suit his needs. On registration 

25 for the enhanced proxy service from the PSP, the user is provided with 
a description of the types browser support and filtering services for the 
default configuration page, as well as a description of the proxy's 
additional configuration pages and associated filtering services that 
may be obtained at the user's request. If the user wishes to change the 

3 0 default filtering services, he requests the additional filtering services by 
using his browser to access the desired configuration page containing 
the described filtering services. It should be noted that it is the act of 
requesting a specific configuration option page which causes the proxy 
to re-define the filtering services for that user in the proxy 

3 5 configuration database. 
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The process of establishing and selecting proxy filtering services 
by querying a page of configuration information on the proxy 
configuration database can be more fully appreciated by referring for 
FIG. 7. The process described here also begins at step 410 when the 
5 query 160 has been received by the enhanced proxy 300 at the proxy 

attachment point 310. The process illustrated in FIG. 7 is inserted after 
step 410 and before step 415 of FIG. 3. 

At step 800 of FIG. 7 the enhanced proxy 300 determines which 
configuration page has been requested by the browser. Proceeding to 

1 0 step 805 the proxy retrieves the requested configuration page from the 
proxy information storage 330. Next, at step 810, the browser ID is 
determined. As mentioned previously, a browser request for a 
particular configuration page will cause the proxy to re-define the 
filtering services for that browser in the proxy configuration database 

1 5 340. This is accomplished in step 815. Next, at step 820 the proxy 

prepares a response containing the retrieved configuration page to be 
sent to the browser, as described above with reference to FIG. 4. 

A fourth method of establishing proxy filtering services involves 
a fetching and responding to a configuration form located on the proxy 

20 information source to establish the filtering services for the particular 
browser. As for the method described with reference to FIG. 7, the 
proxy service provider pre-defines an HTML configuration form which 
resides on the proxy information storage 330. The HTML configuration 
form contains information about the types of browsers supported (i.e., 

25 standard vs. enhanced), and a description of the filtering services 
offered. However, this method contrasts with that described above 
with reference to FIG. 7 in that a comprehensive description of all the 
filtering services supported by the proxy are located on a single HTML 
configuration form rather than on multiple HTML configuration 

3 0 pages. 

For each subscriber (i.e., browser ID) of the proxy service, an entry 
is created in the proxy configuration database 340 which describes the 
filtering services selected for that browser ID as are described by the 
options chosen on the configuration form. When a user subscribes to 
3 5 the proxy service he is provided with an initial default level of filtering 
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service, and is entered as such in the proxy configuration database 340. 
The default level of filtering service is given on the configuration form 
via a pre-defined series of options which are entered on the 
configuration form. This level of filtering service is used by all 
5 browsers until they initiate a configuration change. 

Like the method previously discussed with reference to FIG. 7, 
this method is based on the assumption that, at registration the 
subscriber is given instructions on where and how to obtain the 
configuration form (i.e., given the URL for accessing the proxy 

1 0 information storage 330). The user initiates a change in his level of 

filtering service by requesting and fetching the configuration form from 
the proxy information storage 330, completing the form (i.e., he 
responds to the questions posed regarding the capabilities of his 
browser, the filtering services desired, etc.) and returning the completed 

1 5 configuration form via his browser to the enhanced proxy 300. The 
procedures used to facilitate this process are already described above 
with reference to FIG. 2. 

The process of selecting the filtering services via the completed 
configuration form can be more fully appreciated by referring to FIG. 8. 

20 The process begins at step 900 when the completed configuration form 
(i.e., query 160) has been received on the browser link 110 at the proxy 
attachment point 310. Next at step 905 the enhanced proxy determines 
if the query 160 contains an HTML form by checking for HTML 
formatting commands that denote an embedded form. Proceeding to 

25 910 the proxy checks if this is it's configuration form. If it is, then step 
915 follows. If it is not a configuration for the forms configuration 
procedure ends and the proxy continues on with further processing of 
the query. Having received a configuration form, processing continues 
at step 915. Here standard data processing techniques are employed by 

3 0 parser 344 to extract from the completed form the browser ID and the 
filtering services selected. At step 920 the extracted information is 
validated. For example, the browser ID verified as being a legitimate 
proxy service subscriber. At step 925 the validity of the extracted 
information is checked. If the extracted information is valid, 

35 processing continues to step 930. Step 930, for the browser ID 
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determined from the configuration form, the proxy configuration 
database 340 is updated with the filtering services as extracted from the 
configuration form. At step 935 a response is prepared for the browser 
which will contain confirmation of the newly defined filtering services 
5 as contained in the proxy configuration database. Processing continues 
at step 945. 

If, at step 925 the extracted information is not valid, a response is 
prepared for the browser which indicates the errors that were detected. 
Processing continues at step 945 where the response is returned to the 

1 0 enhanced proxy which will use the procedures described above with 
reference to FIG. 4 to return the response to the browser. 

This particular method is generally only appropriate for forms- 
compatible browsers. If the browser does not support forms, one of the 
previously mentioned methods is utilized. 

1 5 While four different means of establishing filtering services on 

the proxy have been presented, the methods presented here are not 
mutually exclusive of each other. Rather, the configuration methods 
described for the enhanced proxy have been created so that they can 
used alone or in combination with each other as determined by the 

20 proxy service provider. 

There now follows a description of the modifying of a query 
received from the browser prior to forwarding it on to the information 
source. (Below is described the modif3ang of the response received 
from the information source before forwarding it on to the browser.) 

25 Employment of query modifying does not preclude the use of response 
modifying and vice versa. Both classes of modifying are designed to be 
used as determined by the enhanced proxy according to the filtering 
services defined in the proxy configuration database for the specific 
browser ID. 

30 The specific modifying techniques presented here are devised to 

improve the efficiency of the use of the formatting and protocol 
conventions (e.g., HTTP and HTML). Efficient use of these 
conventions is important because the browser link may be a limited 
bandwidth resource such as an RF channel. 
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For a network operator employing an enhanced proxy, an 
increase in channel efficiency means that a greater number of 
subscribers can simultaneously access the network which in turn 
means improved network operator revenue. For the subscriber, an 
5 increase in channel efficiency results in a corresponding improvement 
in access time and cost. Such improvements lead to improved 
customer satisfaction, which in turn also serve to increase the revenue 
of the network 

To explain query modifying, there now follows a brief 
1 0 description of the format and structure of the Uniform Resource 
Locator (URL). The purpose of a URL is to indicate where a file is 
located on the Web. The structure of a URL is based on UNIX file 
description conventions and consists of four parts: the protocol, the 
host name, the path and the file name. For example: 
1 5 http://www.mot.com/General/prodport.html 

is the location of the Motorola World Wide Web page. In this example 
the protocol is "http'', the host name is "www.mot.com'% the path is 
"/General/" and the file name is "prodport.html". 

The protocol determines the type of access tool or 
20 telecommunications software that is required by the browser to fetch 

and read the files stored at the information source. Protocol descriptors 
commonly encountered include email, ftp, gopher, http, telnet to name 
just a few. The host name is often commonly referred to as the domain 
name. This descriptor indicates the name of the machine on which the 
25 information is located. The most common extensions encountered are: 
.com which indicates a commercial organization; 
.edu which indicates an educational institution, and 
.org which indicates a non-profit organization. 
Frequently the host name includes a 2-character country code suffix, 
30 which indicates the country in which the host resides. The path 

indicates in which directory on the host machine the file is located, 
while the file name indicates the specific file in which the information 
is found. 

Turning again to the instant invention, techniques which the 
3 5 enhanced proxy may employ for query modifying include, alone or in 
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combination, restricting access to an information source based on the 
protocol contained in the query and preventing access to specific 
information sources associated with the browser as being restricted 
sources. 

5 Access to an information source may be restrictedaccording to 

the protocol contained in URL of the query. For example, the proxy 
may be configured to deny access to information sources of a specifc 
type such as gopher, ftp (file transfer protocol) or email. An advantage 
of such a restriction is that the user may not know what type 

1 0 information is being requested and may not wish to initiate a query 
which would cause the download of an exceptionally large file. 

Access to specific information sources associated with the 
browser is restricted by, for example, denying access by a specific browser 
to specific URL sites or classes of URL sites. To accomplish this, the 

1 5 proxy configuration database 340 maintains lists of prohibited sites for 
different browsers or groups of browsers. A browser can make 
recommendations to add restricted sites to the block, on the basis of 
specific pages or entire domains. Restricted lists are maintained by the 
proxy centrally for all proxy subscribers. Further details of this process 

20 are illustrated in the flow diagram of FIG. 9. 

In step 1000 of FIG. 9, the contents of a query are extracted to 
obtain URL reference information. In step 1005, protocol restrictions 
for the particular browser ID are fetched from the proxy configuration 
database 340. If, in step 1010, the extracted URL protocol matches a set 

25 of browser URL restriction parameters, step 1015 causes a response to be 
formulated (in proxy information storage 330) indicating that access 
through the URL protocol is not supported by the established 
configuration services. If, in step 1010, the extracted URL protocol does 
not match the restrictions, step 1020 causes the URL restrictions for the 

30 particular browser ID to be fetched from the configuration database 340. 
If, in step 1025 the extracted URL hostname, pathname and /or filename 
matches the browser restriction classes, step 1030 causes a response to be 
formulated indicating access to the host/path/file is denied according to 
the browser restriction classes established for that browser. 
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As a further example of modification of a query, at step 420 the 
proxy compares the URL of the query 160 to its restricted site list to 
determine if access is permitted. If access is denied, the query is 
modified to reflect the denial of access to the information source and to 
5 re-direct the query to the proxy information storage 330 rather than the 
denied information source, to retrieve a pre-stored message, for 
example a message informing the browser of the denial of access and 
reasons for derual, or other information. 

There now follows a description of modification of a response to 

10 a query by the proxy. Modifications are classed as (i) modifications to 
responses which do not significantly affect the visible content of the 
response; (ii) modifications to responses which alter the visible content 
of the response; (iii) modifications to responses which visibly alter the 
content of the response and include links to the content prior to 

1 5 modification; and (iv) modifications to responses which visibly alter 
the content of the response and include links to other services. The 
first and second classes of modification deliver a full or partial but 
complete response with internal modifications. The third class of 
modification delivers an incomplete response. The former will be 

20 referred to as "internal" modifications and the latter will be referred to 
as "disjunctive" modifications. The fourth class of modification 
delivers an enhanced service and the visible altering of the content is 
subsidiary to the provision of the additional service and is, at a 
minimum, no more than provision of an indication of the availability 

25 of the additiorml service. Common to the third and fourth class of 

modification is the insertion of a link to the additional information or 
service. 

An example of an internal modification is the suppression of the 
display of in-line graphic images which are frequently included in 
30 HTML pages. The proxy modifies the response so that the images are 
replaced by a textual reference to the name and /or size of the image 
which was suppressed. Such a modification is visible to user of the 
recipient browser. 

An example of a disjunctive modification to a response occurs 
3 5 when the filtering (i.e. modifying) services indicate that the proxy 
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should impose a limit on the number of bytes which may be sent a 
response to a browser. In such a case, the proxy splits the page of HTML 
information into smaller sub-components and modifies the response 
to include a textual reference indicating the size in bytes of the 
5 remaining sub-components. The modified response includes part of 
the HTML page and the textual size reference to the maximum number 
of bytes permitted for that particular browser. 

Techniques that the proxy optionally employs for response 
modifying include any of the following alone or in combination with 
1 0 each other. 

The three classes of modification referred to are now described in 
greater detail. 

A. Modifications to responses which do not significantly affect the 
15 visible content of the response. 

This class of modifications removes redundant information or 
information which is commentary in nature and is not normally 
visible 

20 to the user. There now follows four examples. 

i) Remove comment tags. For example, HTML comment tags <!— 
... --> and specialized informational tags, e.g. <meta>, <nextid> etc. 
These can be removed by the following algorithms. 
25 1. A high level "Remove informational tags" algorithm: 

1.1 Fetch the 'list of tags to strip" for this browser. 

1.2 Scan the response for "start of tag" characters, and 
for each found: 

1.2.1 If the tag name matches a tag in the "list of 
3 0 tags to strip", 

1.2.1.1 Delete all characters until the 
matching "end of tag" character. 



35 



2. A more detailed "Remove informational tags" (without 
embedded tags): 
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2.1 Set STATE to "not in tag to strip". 

2.2 For each character of the HTML response: 
2.2.1 If STATE is "not in tag to strip", then 

2.2.1.1 If the character is a start of tag (i.e. '< ), 
5 then 

2.2.1.1.1 For each element of the 
"tags to strip list": 
2.2.1.1.1.1 If the tag in the 

response matches the tag 
10 in the list, 

2.2.1.1.1.1.1 Set STATE = 
"in tag to strip", and 

2.2.1.1.1.1.2 Break out of for 
loop. 

1 5 2.2.1.1.2 If STATE is still "not in tag to 

strip", then 

2.2.1.1.2.1 Output the 
character, (i.e. '<'.) 

2.2.1.2 Else the character is not a start of tag, 
20 so output it. 

2.2.3 Else if STATE is "in tag to strip", then 

2.2.3.1 If the character is an end of tag (i.e. *>'), 
then 

2.2.3.1.2 Set STATE = "not in tag to 
25 strip". 

This algorithm can easily be adapted to handle the stripping of tags 
with embedded tags. 

30 ii) Remove unsupported or unwanted features. Many mobile 
browsers may not support all HTML options. For example, the 
<SCRIPT> and <STYLE> tags could be deleted. 



iii) Remove unnecessary markup tags. Some embedded document 
35 markup and control language instructions are redundant. For 
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example, in HTML an end-of-paragraph tag (i.e., </P>) can be removed 
where it is followed by a new paragraph symbol (i.e., <P>), separated by 
only white space. There are other examples of paired tags where a 
closing tag can be 
5 implied by a subsequent opening tag, such as <TH></TH>, 
<TD></TD>, and <TR></TR>. 

iv) Shorten link URL addresses. Often documents contain links to 
documents on the same site. The URLs specified in those links may be 
1 0 full URLs or they may be relative to the current domain and or page. 
This method searches for and replaces full URLs with equivalent 
relative URLs which are shorter. 

B. Modifications to responses which alter the content of the 
15 response. 

This class of modifications changes the content of the response 

in a 

manner which may be visible to the user. These modifications are 
20 useful when the browser is incapable of displaying the effected format 
of content or the user has decided not to retrieve this type of content. 
Five examples now follow. 

i) Remove graphic images. All references to graphic images, e.g., 
25 HTML <img> tags, are either deleted completely or preferably replaced 
with a short text note. Note that <img> tags may contain an optional 
specification of suitable text to use (the "alt=" option). Alternately, the 
name of the image file can be used as replacement text. 

30 ii) Reduce the size of graphic images. Size reduction can be 
achieved 

in a number of ways. For example: by converting from color to 
greyscale; by re-encoding the image using a more efficient compression 
algorithm; by reducing the resolution of the image; or by a combination 
35 of these methods. Note that it may be appropriate to reduce the size of 
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graphic images to an upper bound limit which is chosen to be suitable 
for the communications protocol. These and other methods to reduce 
the size of images are per se known in the art, but here they are being 
applied as a specific response filtering technique. 

5 

iii) Remove background images. Background images can be 
removed simply by deleting the "background=" attribute from the 
<BODY> tag. 

1 0 iv) Censor by content. This filter searches the response for pre- 
defined words, e.g., "sex" or "Nazi" or and, if found, replaces the entire 
response with a pre-defined message indicating the original response 
was censored. Alternately, each occurrence of the restricted word is 
replaced with some pre-defined text, such as "expletive deleted". 

15 

v) Reformat tables. HTML formatted tables can, often times, be 
translated into a more efficient (although not perfect) format. For 
example, they can be translated into a pref or matted block (HTML's 
<pre>...</pre>) using line feed and tab characters to format. 
20 Furthermore, some basic browsers do not understand tables at all, but 
most understand preformatted text. For example, the simple HTML 
table (about 200 bytes): 
<TABLE> 

<TR> <TH>Name <TH> Rate<TH> Hours </TR> 
25 <TR> <TD>Tom </TD> <TD> 50 </TD> <TD> 40 </TD> 

</TR> 

<TR> <TD>Dick </TD> <TD> 25 </TD> <TD> 40 </TD> </TR> 
<TR> <TD>Harry </TD> <TD> 32 </TD> <TD> 22 </TD> 
</TR> 
30 </TABLE> 

could be replaced with (about 50 bytes) as follows: 
<pre> 

NameRate Hours 
Tom 50 40 

35 Dick 25 40 
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Harry 32 22 
</pre> 

This translation can be effected simply by scanning the response 
and when the tag <TABLE> is encountered, starting a preformatted 
5 block (i.e. output a <PRE>). Then every <TR> output a line feed, and 
every <TH> or <TD> output a tab. All other white space is ignored. 
Upon reaching the </TABLE>, end the preformatted block </PRE>. 
Care must be taken if dealing with nested tables or other fancy 
constructions. The number of tabs (or other white space) can be 
1 0 adjusted based on the originally specified table dimensions. Tables can 
also be more easily translated into a comma 
delimited format which is commonly used with spreadsheets. 

C, Modifications to responses which visibly alter the content of the 
1 5 response and include links to the content prior to modification. 

This class of modifications changes the content of the response 
and, depending on the filtering characteristics in effect, selectively 
causes a link to be added to a temporarily stored version of the content 
20 that was altered. Four examples follow. 

i) Replace an image with a link, such as a size-labeled link. For 
example, the following in-line image tag: 

<IMG SRC="skyline.gif" ALT="New York Skyline"> is replaced 
25 with: 

<A HREF="http://pl.proxy.net/tmp/1234/skyline.gif'> 
[New York Skyline][54K] </A> 
This has the effect of reducing the initial 46 byte IMG tag plus 
54,000 bytes of skyline.gif (or whatever the image file size actually is) 
3 0 with a 77 byte link. The user chooses whether or not to fetch the 54,000 
bytes of skyline.gif. Note that this method differs from known 
techniques of disabling images because (1) this mechanism includes an 
automatic link to the individual image file; and (2) the link 
information includes the size or approximate size of the image file. 
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As an improvement on this method of filtering and modifying, 
a configured size is predefined and only images that are larger than the 
configured size are replaced. The predefined size is included in the 
browser's configured filtering characteristics. 
5 In another variant, the original image is automatically replaced 

with a very small compressed version of the image which is set up as a 
link. In this case, the original HTML is translated into: 

<A HREF="http:/ /pl.proxy.net/tmp/1234/skyline.gif' > 
<IMGSRC="http://pl.proxy.net/tmp/1234/thumbnail 
10 of^skyline.gif" ALT="New York Skyline"> [54K1 </A> 

where "thumbnail_of_skyline.gif" is a very small (to a configured 
maximum) version of the original image file created by the enhanced 
proxy using traditional image compression techniques. The following 
algorithm can be used to accomplish this: 
15 1. Parse the response to be processed. 

2. Upon finding an IMG tag, fetch the specified image file. 

3. If the IMG tag is already in the scope of a link (an <A> in 
HTML), then note that fact for use later. 

4. Determine the minimum image size to replace. 

20 This could either be a constant for the browser, dictated 

largely by the minimum overhead to replace an image 
with a link, or a value dictated by the filtering 
characteristics configured for the browser. 

5. If the size of image file is less than this minimum size, 
25 goto step 7.0. 

6. The image file is large enough to process, so do the 
following: 

6.1 Determine a location for the image file in the proxy 
local storage. This location would be a file, the 

3 0 name of which could be based on the original name 

of the image file (from the SRC attribute) and /or 
the date and time and /or the browser being served. 
Identifying the file in this way could facilitate 
garbage collection of the temporary file storage. 

3 5 6.2 Save the image file in that location. 



wo 98/33130 



PCT/US97/23431 



32 

6.3 Replace the <IMG ...> tag with a "open link" tag (in 
HTML <A>), where the URL of the new link points 
to the filename containing the image just saved. 
Note that if the image had previously been within 

5 the scope of a link, as noted in step 2.0 above, then 

skip this step. 

6.4 Construct the label for the link (i.e. the text (or 
image) between the <A> and </A> tags). If a text 
label, then this can be constructed from the text 

1 0 specified by the ALT attribute of the IMG tag or 

from filename from the SRC attribute if there is no 
ALT attribute, and enclose this label in square 
brackets so the user can tell this a synthesized link. 
If configured for small image (i.e. thumbnail) link 

1 5 labels, then compress the image to the appropriate 

size, save a copy of it and insert an IMG tag 
referencing the small image. 

6.5 Insert the size of the image file (in K), enclosed in 
square brackets. 

20 6.6 Output the "close link'' tag, (in HTML ••</a>"). 

7. Return the (possibly modified) response. 



ii) Split a response into smaller pieces, and into each piece add a 
link to the next piece and/or the previous piece and/or the remainder 

25 of the response. 

This step can be performed after other steps described above. E.g., 
after other modifications have been made to the page, its size can be 
compared to some configured limit, and if above that limit, the page 
can be split into two or more pieces. Each piece is stored on the 

3 0 enhanced proxy's local information storage device, after it had been 
suitably modified. 

Note that it is preferable to split the page intelligently, so as not 
to breakup a tag or paired tags, (e.g. start of table and end of table). It is 
also desirable not to break the page in the middle of a paragraph or 

3 5 word. 
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In the simplest embodiment, it is not necessary to break the page 
into all of the appropriate sized pages, just the first piece and the rest, 
then when the rest is accessed (if it ever is) the same process will split it 
again should this be needed. An algorithm to split an HTML response 
5 is: 

1. Determine the current size of the response. 

2. Determine the maximum size response for this browser 
bv. for example, looking up the proxy configuration 
database entry for this browser, or by applying a maximum 

1 0 for this proxy. 

3. If the current size of the response is less than the 
maximum, 

3.1 Return the response so it can be delivered to the 
browser. 

15 4. Determine the maximum amount of the response that 

can be included in the first piece. This is the maximum 
size response, minus the size of the epilogue (link to rest 
of response and required ending of the response). 

5. Parse the initial portion of the response, up to the 

20 maximum size to be sent, and determine a good place to 

break the page. For example, the page should never be 
broken in the middle of an HTML tag, and should avoid 
where reasonable, breaking the page between starting and 
endings of paired tags. Some tag pairs, such as for defining 

25 links (<A>...</A>), must not be broken between starting 

and ending tag. However, it will usually be necessary to 
break between some starting and ending tag pairs. For 
example the <body>... </body> tags and also <table> ... 
</table>. This can be done by keeping track of the nesting 

3 0 depth of tags as the initial part of the response is being 

parsed and noting good places to split the response. 

6. Note the tags pairs that must be fixed up. 

7. Break the page as determined above. 

8. Prepare the "rest of response" piece, by: 
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8.1 Repair the noted broken tag pairs, by adding new 
"start tags". 

8.2 Prepend a suitable preamble (title etc.). 

8.3 Determine a suitable location on the enhanced 

5 proxy local storage for the following piece, and save 

it there. 

9. Fixup the "initiar' piece of the response, by: 

9.1 Repair the noted broken tag pairs, by adding new 
"end tags". 

1 0 9.2 Add a link labeled "more" (or some such) at a good 

spot near the end of the initial page, and address it 
to the just created following page. 

9.3 Append any trailing information that may be 
required or desirable (e.g., an advertisement for the 

1 5 filtering proxy service). 

9.4 Append any needed "end of response" syntax, such 
as </body>. 

10. Return the (now initial) response so it can be delivered to 
the browser. 

20 Other variants of splitting, such as completely breaking into 

palatable sections and creating links to the next and previous pieces, 
would be preformed in a similar, albeit more complicated, way. 

iii) Replace forms. Many smaller browsers do not support forms. 

25 Also forms can be very large, with large selection lists. Forms can be 

replaced in a number of ways. For example, the entire form (in HTML, 
between the <FORM> and </FORM> tag) can be replaced with a simple 
indication of the previous existence of the form, e.g.: 
<BR>[FORM]<BR>, which is more suitable if the browser does not 

3 0 support forms at all. 

Alternatively, the indication can be a link to the form that was 
retained on the proxy in a manner similar to the mechanism above 
used to split responses. The indication can also leave the text of the 
form, but delete the fields, so the user of the browser can have an 
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impression of what the original form contained. Again, this can either 
link to the complete form or not, as appropriate. 

iv) Removing header and footer "boilerplate'*. Boilerplate is 
5 recognizable standardized text and/or graphics which is common to 
different pages, for example successive pages. Many web pages contain 
boilerplate such as advertisements or other not very interesting 
information at both the top and bottom of the page. Usually these are 
demarked from the content of the response by a horizontal line (in 

1 0 HTML, an <HR>). This filter identifies suspected boilerplate and either 
deletes it, or replaces it with a link to a saved version. 

Some simple heuristics for doing this are to identify everything 
in the body if the response, above the first horizontal rule tag (i.e. 
<HR>), and everything below the lowest <HR>. This must be tempered 

15 by a rule such as "don't do this if the first (or last) <HR> is more than 

10% into (or before the end of) the response". Another useful rule is to 
look for areas between <HR> tags or the edges of the body, which 
contain no text, just images and formatting information. It is likely 
that these are advertisements. 

20 

D. Modifications to responses which visibly alter the content of the 
response and include links to other services. 

Two examples of these follow. 

25 

i) Insertion of a standard link to a page in proxy information 
storage 330. Such a page includes, for example, help desk information, 
notices, advertisements, etc. The standard link is applied to all 
responses when this filtering service is invoked. 

30 

ii) A particularly interesting and useful form of modification to a 
response which includes a link to other services is the invoking of a 
non-proxy service in peripheral device 355, for example a non-proxy 
communication service such as a fax or paging service. 
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The link is inserted into responses when the filtering service is 
invoked and, if activated by the user at the browser, the link causes a 
predefined query to be received at the enhanced proxy 300 and causes 
the non-proxy service to be invoked, e.g. causing a fax to be sent to a 
5 prearranged fax number. Many such links can be added, each causing a 
different fax page to be faxed to the prearranged number. Such a service 
can be used, for example, for facilitating the automated delivery of 
graphic images to a browser not having graphic image capability. 
Alternatively, it can be used for ordering off-line prints of material 

1 0 where the user does not have time to request each page of the material. 
This feature is especially useful where on-line time is limited or the 
link 110 is a slow one, e.g. a bandwidth-limited radio link. 

The link inserted by the enhanced proxy 300 in performing a 
modification of this nature is a link referencing the non-proxy service 

1 5 either directly as described above or via a form stored in proxy 

information storage 330. The form includes space for the user to insert 
a fax number or other information for the non-proxy service peripheral 
device 355 to perform its function. In this embodiment, the receipt at 
the enhanced proxy 300 of the completed form activates the peripheral 

20 device 355. 

Thus, an improved method of accessing and retrieving 
information in a networked data communications system has been 
described comprising: sending from a browser to a proxy a query 

25 directed to an information source; modifying the query by the proxy to 
provide a modified query; forwarding the modified query to the 
information source; receiving at the proxy from the information 
source a response to the modified query; modifying the response by the 
proxy to provide a modified response; and forwarding the modified 

3 0 response to the browser. 

The arrangement provides a user with a means of requesting 
and selecting different services such as filtering from the proxy. For 
instance, a browser may contain the ability to compress and un- 
compress information and the user may want to use that capability 

3 5 when transferring HTML data. In another instance, the proxy 
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administrator may want to provide a service to the users of the proxy 
which prohibits the access of minors to specific servers. 

This additional functionality is particularly useful where the 
browser link 110 is a wireless radio link, where the bandwidth is 
5 limited. As data transfer rates over a wireless radio browser link are 
relatively slow, it is advantageous to be able to select filtering services 
which reduce data transfer charges or avoid frustratingly large file 
transfers. 

Modifications to the arrangements described can be made within 

1 0 the scope of the invention. For example a user's browser may be 
required to know the attachment points associated with each set of 
filtering services they wish to use. This information may be provided 
via prior communication with the proxy administrator. 

It should be noted that the use of query filtering by the proxy 

1 5 does not preclude the use of response filtering, and vice versa. 

The enhanced proxy 300 is described as having a cache available 
in which to store the retrieved information. Optionally the user may 
refuse the information, request only part of the information, or request 
all of the information. The proxy will be able to respond to the user by 

20 supplying the information according to the user's response to the 

advance indication coupled with the filtering characteristics defined for 
that user in the proxy configuration database. In a wireless 
communications system, advance file characteristic indication provides 
an improvement to Web browsing by not wasting the limited 

25 bandwidth of the channel by sending information which may not be 
wanted in its entirety by the user. 

Enhancements to the proxy may include any of the following 
alone or in combination with each other: 

i) automatically identifying details of pages (e.g. images, boilerplate, 
30 etc.) to facility local semi-permanent caching; 

ii) automatically fetching cached HTML boilerplate; 

iii) establishing a permanent "open'' proxy session link for data 
transmission; 

iv) encrypting the data; 
35 v) compressing data; 
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vi) efficiently re-encoding HTML tags via an extension to the 
current HTML standard; 

vii) abbreviate URL prefixes; 

viii) fetching and caching URL references during an optimal time 
5 schedule; 

ix) automatically adjusting the degree of browser modifications 
based on an optimal time schedule; 

x) automatically adjust degree of browser modifications based on 
the response time to queries; 

1 0 xi) reformatting tabular data to achieve optimal data compression; 

Further modifications can be made within the scope and spirit of 
the invention. 
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Claims 

1, A method of accessing and retrieving information in a 
networked data communications system comprising, at a proxy: 
5 receiving from a browser a query directed to an information 

source; 

establishing a set of services associated with the browser; 
selecting a method of filtering to apply to the query as a function 
of the set of services associated with the browser; 
1 0 modifying the query according to the method of filtering to 

provide a modified query; 

forwarding the modified query to the information source; 
receiving from the information source a response to the 
modified query; and 
1 5 forwarding the response to the browser. 



2. The method of claim 1 further comprising: 
configuring the browser to direct the query to a specific 

attachment point of the proxy; and 
20 receiving the query at the specific attachment point. 

3. The method of claim 1 wherein, the step of modifying the query 
comprises filtering the query information by the proxy. 



25 4. The method of claim 1, wherein the set of services established 
for a selected browser are determined by at least one of: 

associating a class of service with a proxy host computer 
attachment point; 

associating a set of filtering methods based on capabilities of the 
3 0 selected browser; 

implying proxy parameters from a browser query; and 
configuring proxy parameters for the selected browser in 
response to a fetched form. 
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5. The method of claim 4, wherein the step of configuring the 
proxy parameters includes: 

identifying a particular type of configuration request from 
contents of the query; 
5 modifying a proxy configuration parameter database according to 

information contained in the query; and 

sending a status response back to the selected browser on success 
or failure of the configuration request. 

10 6. The method of claim 1, wherein the step of modifying the query 
as a function of the set of services established for a selected browser 
includes the steps of: 

providing an advance indication of characteristics of 
information requested; and 

1 5 altering the query by the proxy, 

7. The method of claim 6, wherein the step of providing an 
advance indication of the characteristics of information requested 
includes at least one of: 
20 armotating a physical size of a data file on the proxy; 

annotating file type characteristics; and 
annotating a file content classification on the proxy. 
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8. A method of accessing and retrieving information in a 
networked data communications system comprising, at a proxy: 

receiving from a browser a query directed to an information 
source; 

5 establishing a set of services associated with the browser; 

forwarding the query to the information source; 
receiving from the information source a response to the query; 
selecting a method of filtering to apply to the response as a 
function of the set of services associated with the browser; 
1 0 modifying the response according to the method of filtering to 

provide a modified response; and 

forwarding the modified response to the browser. 



9. The method of claim 8 further comprising: 

1 5 configuring the browser to direct the query to a specific 

attachment point of the proxy; and 

receiving the query at the specific attachment point. 

10. The method of claim 8 wherein, the step of modifying the 
20 response comprises filtering the response information by the proxy. 



11. The method of claim 10, wherein the step of establishing a set of 
services for the browser comprises at least one of: 

associating a class of service with a proxy host computer 
25 attachment point; 

associating a set of filtering methods based on capabilities of the 
browser; 

implying proxy parameters from a browser query; and 
cortfiguring proxy parameters for a browser in response to a 
30 fetched form. 
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12. The method of claim 11, wherein the step of configuring the 
proxy parameters includes: 

modifying a proxy configuration parameter database according to 
information contained in response to the fetched form; and 
5 sending a status response back to the browser. 

13. The method of claim 8, wherein the step of modifying the 
response as a function of the set of services established for the browser 
includes the step of providing to the browser an advance indication of 

1 0 characteristics of information retrieved. 

14. The method of claim 13 wherein the step of providing an 
advance indication of the characteristics of the information retrieved 
comprises at least one of: 

1 5 annotating a physical size of a data file on the proxy; 

annotating file type characteristics; 
annotating a file content classification on the proxy. 

15. The method of claim 8 wherein the step of modifying the 
20 response comprises splitting pages of information into smaller 

components. 

16. The method of claim 8 wherein the step of modifying the 
response comprises removing at least one of: text comments; forms; 

25 backgrounds; graphic images; embedded document markup and control 
language instructions; and boilerplate HTML commands. 

17. The method of claim 8 wherein the step of modifying the 
response comprises at least one of: 

3 0 reducing an image in size by a scalable factor; 

reducing an image in size by converting from color to gray-scale; 
reducing an image in size by re-encoding the image using a 
compression algorithm; and 

limiting a maximum size of a graphic image. 

35 
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18. The method of claim 8 wherein the step of modifying the 
response comprises replacing an image with at least one of: 

a textual reference to its size; 
a thumb nail reduction of the image; and 
5 a reference name to local cache of the image, 

19. The method of claim 8 wherein the step of modifying the 
response comprises reformatting tables. 

1 0 20. The method of claim 8 wherein the step of modifying the 
response comprises at least one of: 

minimizing universal resource locator (URL) prefixes; and 
restricting access to URL classes. 

15 21. A proxy host computer for accessing and retrieving information 
between a networked data communications system and a browser, the 
proxy host computer comprising: 

a file of services established for at least one specified browser; and 
a filter for filtering queries from browsers, the filter being 
20 responsive to the file of services for filtering queries from the at least 
one specified browser. 

22. The proxy host computer of claim 21, further comprising a proxy 
configuration database. 

25 

23. The proxy host computer of claim 22, further comprising look- 
up means for looking up parameters in the proxy configuration 
database according to information contained in a query. 
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24. A method of accessing and retrieving information in a 
networked data communications system having a proxy comprising: 

recording at the proxy a set of services associated with a browser; 

sending a query from the browser to the proxy, the query being 
5 directed to an information source; 

selecting a method of filtering to apply to the query received 
from the browser as a function of the set of services associated with the 
browser; 

modifying at least one of the query and a response to the query 
1 0 according to the method of filtering; and 

sending from the browser to the proxy a request to change the set 
of services associated with the browser. 



25. The method of claim 24 wherein the step of sending the query 
1 5 comprises accessing a configuration page at the proxy. 

26. The method according to claim 25, comprising the step of re- 
defining the set of services associated with the browser upon receipt at 
the proxy of a request to access the configuration page at the proxy. 



20 



25 



30 



27. The method of claim 24 wherein the step of recording comprises: 
registering the browser on the proxy; 

establishing an initial default level of services; and 
associating the initial default level of services with the browser. 

28. A proxy host computer comprising: 

a configuration database of services associated with browsers; and 
a database updater to update services associated with a browser 
upon receipt from the browser of a request for a configuration update. 

29. The proxy host computer of claim 28 comprising at least one 
configuration update page, wherein the database updater is responsive 
to a request from the browser for the at least one configuration update 
page to update the services for the browser. 



35 
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30. The proxy host computer of claim 29 comprising a plurality of 
configuration pages representing different sets of services. 

31. The proxy host computer of claim 28, wherein the configuration 
5 database includes associations between services and attachment points 

of browsers. 

32. The proxy host computer of claim 28 further comprising a 
wireless radio link connecting the proxy host computer to the browser. 

10 

33. A proxy host computer couplable to a browser to receive queries 
and couplable to an information source to deliver queries and receive 
responses, comprising: 

a configuration database of services associated with browsers; and 
1 5 proxy information storage having stored therein modifications 

to queries or responses, the modifications being accessible by the 
browser. 

34. The proxy host computer of claim 33 further comprising a 

20 wireless radio link connecting the proxy host computer to the browser. 

35. A method of accessing and retrieving information by a proxy in a 
networked data communications system comprising: 

associating a set of services with a proxy attachment point; 
25 receiving a query at the proxy attachment point from a browser; 

selecting a method of filtering to apply to the query as a function 
of the set of services; 

modifying the query by the proxy according to the method of 
filtering to provide a modified query; 
3 0 forwarding the modified query to an information source; 

receiving at the proxy from the information source a response to 
the modified query; and 

forwarding the response to the browser. 



wo 98/33130 



PCT/US97/23431 



46 

36. A method of accessing and retrieving information by a proxy in a 
networked data communicatioris system comprising: 

associating a set of services with a proxy attachment point; 
receiving a query at the proxy attachment point from a browser; 
5 forwarding the query to an information source; 

receiving at the proxy from the information source a response to 
the query; 

modifying the response by the proxy according to the set of 
services to provide a modified response; and 
1 0 forwarding the modified response to the browser. 

37. A method of accessing and retrieving information in a 
networked data communications system comprising: 

receiving from a browser at a proxy a query directed to an 
15 information source; 

modifying the query by the proxy to provide a modified query; 
forwarding the modified query to the information source; 
receiving at the proxy from the information source a response to 
the modified query; 
20 modifying the response by the proxy to provide a modified 

response; and 

forwarding the modified response to the browser. 

38. A method of accessing and retrieving information by a proxy in a 
25 networked data communications system comprising: 

receiving from a browser a query directed to an information 
source; 

forwarding the query to the information source; 
receiving at the proxy from the information source a response to 
3 0 the query; 

modifying the response by the proxy to provide a modified 
response by inserting a link referencing a non-proxy service; and 
forwarding the modified response to the browser. 

35 39. A proxy host computer comprising: 
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an attachment point at which a query is receivable; 

a link to an information source from which a response to the 
query is receivable; 

a filter operable on the response by insertion of a link referencing 
5 a non-proxy service; and 

a peripheral device which provides the non-proxy service and is 
activatable by the proxy host computer in response to a query received 
at the attachment point. 

1 0 40. The proxy host computer of claim 39 further comprising a 

wireless radio link connected to the attachment point for connecting to 
a browser. 
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